Real-time voice processing system for the loudspeakers under noisy environments
نویسندگان
چکیده
منابع مشابه
Real-time audio-visual voice activity detection for speech recognition in noisy environments
Voice activity detection (VAD) is one of the most critical issues on performance degradation of speech recognition in noisy environment applications. A real-time VAD was developed by using face parameters (eye and lip contours) as a front-end for the traditional speech and noise (audio) GMMbased method. Speech recognition performance of the audiovisual VAD is shown to be comparable with audio-o...
متن کاملA hierarchical model for processing noisy and partial information in large-scale real-time task environments
In this paper we introduce the Incremental Distributed Dispatcher Manager (IDDM) that is designed to handle control problems with large numbers of tasks and cooperative agents where only partial and noisy information is available. The IDDM is a modification of the DDM model that was developed to solve similar problems but in environments where accurate information is available. There were a num...
متن کاملVoice activity detection in noisy environments
The subject of this paper is robust voice activity detection (VAD) in noisy environments, especially in car environments. We present a comparison between several frame based VAD feature extraction algorithms in combination with different classifiers. Experiments are carried out under equal test conditions using clean speech, clean speech with added car noise and speech recorded in car environme...
متن کاملVoice-Driven Computer Game in Noisy Environments
The paper describes the performance of a task-oriented continuous automatic speech recognition (ASR) system in the computer game interface in noisy conditions. First, the process of designing the ASR system for Polish, based on CMU Sphinx4, is presented. Then, the concept of the computer game called Rally Navigator is described. The experiments were first run for the clean speech, and then repe...
متن کاملA model based voice activity detector for noisy environments
This paper presents a model-based voice activity detector (VAD) aimed at operating in low signal to noise ratio conditions and non-stationary noise environments. The proposed system makes use of Gaussian mixture models trained on Mel Frequency Cepstral Coefficients extracted from noisy speech data. In addition, information from smoothed frame based log energy is used to augment the system to de...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: The Japanese Journal of Ergonomics
سال: 2020
ISSN: 0549-4974,1884-2844
DOI: 10.5100/jje.56.1g1-03